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Abstract 

Background: Cooperio oncophora and Ostertagia ostertagi are among the most important gastrointestinal 
nematodes of cattle worldwide. The economic losses caused by these parasites are on the order of hundreds of 
millions of dollars per year. Conventional treatment of these parasites is through anthelmintic drugs; however, as 
resistance to anthelmintics increases, overall effectiveness has begun decreasing. New methods of control and 
alternative drug targets are necessary. In-depth analysis of transcriptomic data can help provide these targets. 

Results: The assembly of 8.7 million and 1 1 million sequences from C. oncophora and 0. ostertagi, respectively, 
resulted in 29,900 and 34,792 transcripts. Among these, 69% and 73% of the predicted peptides encoded by C. 
oncophora and 0. ostertagi had homologues in other nematodes. Approximately 21% and 24% were constitutively 
expressed in both species, respectively; however, the numbers of transcripts that were stage specific were much 
smaller (-1% of the transcripts expressed in a stage). Approximately 21% of the transcripts in C. oncophora and 22% 
in 0. ostertagi were up-regulated in a particular stage. Functional molecular signatures were detected for 46% and 
35% of the transcripts in C. oncophora and 0. ostertagi, respectively. More in-depth examinations of the most 
prevalent domains led to knowledge of gene expression changes between the free-living (egg, L1, L2 and L3 
sheathed) and parasitic (L3 exsheathed, L4, and adult) stages. Domains previously implicated in growth and 
development such as chromo domains and the MADF domain tended to dominate in the free-living stages. In 
contrast, domains potentially involved in feeding such as the zinc finger and CAP domains dominated in the 
parasitic stages. Pathway analyses showed significant associations between life-cycle stages and peptides involved 
in energy metabolism in 0. ostertagi whereas metabolism of cofactors and vitamins were specifically up-regulated 
in the parasitic stages of C. oncophora. Substantial differences were observed also between Gene Ontology terms 
associated with free-living and parasitic stages. 
(Continued on next page) 
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(Continued from previous page) 

Conclusions: This study characterized transcriptomes from multiple life stages from both C. oncophora and 0. 
ostertagi. These data represent an important resource for studying these parasites. The results of this study show 
distinct differences in the genes involved in the free-living and parasitic life cycle stages. The data produced will 
enable better annotation of the upcoming genome sequences and will allow future comparative analyses of the 
biology, evolution and adaptation to parasitism in nematodes. 
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Background 

Studies on the free-living nematode Caenorhabditis 
elegans have provided a wealth of information on meta- 
zoan biology and development. However, being a mem- 
ber of the Nematoda has periodically engendered 
erroneous assumptions that C. elegans is a measurable 
representative of other nematodes within this phylum. 
More recent studies on the genomes and transcriptomes 
of other nematodes have demonstrated the extensive 
diversity within this group and the need to look more 
closely at individual genera to begin addressing questions 
related to nematode parasitism and host-parasite 
relationships. 

Cooperia oncophora and Ostertagia ostertagi are two 
parasitic nematodes of the order Strongylida that belong 
to the same phylogenetic clade as C. elegans [1]. Both 
species are parasites of bovids in more temperate 
regions of the world. The diseases caused by these 
nematodes are among the most costly to the cattle in- 
dustry where hundreds of millions of dollars are lost 
each year in lower productivity and higher management 
expenses. Treatment of cattle infected with these 
strongylid nematodes commonly involves anthelmintic 
drugs; however, similar to what has been observed in 
many microorganisms, drug resistance has become a sig- 
nificant problem within this group of parasites [2]. In 
spite of their economic impact, a dearth of information 
is available on their molecular biology. 

Parasites of the genera Cooperia and Ostertagia as well 
as other Strongylida exhibit similar life cycles that begin 
with fertilized eggs being passed in the host feces. Like 
C. elegans, the first three larval stages (LI, L2, and L3) 
are considered free-living because they are environmen- 
tally-exposed but with no host dependency. The infect- 
ive L3 has a protective sheath (L3sh) that allows for 
movement on pasture while protecting the parasite from 
ecological pressures. Upon ingestion, however, the 
nematodes become host-dependent i.e. parasitic; the L3 
exsheath (L3ex), develop to the fourth larval stage (L4) 
and continue development to adults in the abomasum 
{Ostertagia) or the intestines {Cooperia). Despite their 
biological similarities, infection by O. ostertagi does not 
confer strong immunity against reinfection except in cat- 
tle which have been infected for extended periods of 



time [3]. Cattle which have been infected by C. 
oncophora, on the other hand, attain resistant to reinfec- 
tion more readily [4]. Furthermore, even though cattle 
are often found simultaneously-infected with both spe- 
cies, anthelmintic resistance has only been documented 
in Cooperia spp. 

Deciphering the underlying biological differences be- 
tween these two similar organisms may open the path 
for more holistic hypotheses on host-parasite relationships, 
host immunity, and the development of drug resist- 
ance. Detailed and comparative explorations of their 
transcriptomes and genomes would not only provide 
insights into fundamental biological processes, but 
underpin the discovery of new treatments and con- 
trol methods that may be broadly applicable to other 
less similar nematodes. Although limited transcriptomic 
information is available for two developmental stages of 
O. ostertagi, [5] this falls woefully short of representing 
the entire life cycle and providing insights into what dif- 
ferentiates the free-living and parasitic stages. Currently, 
no transcriptomic data are publicly available for C. 
oncophora. Analysis of transcriptome data and their com- 
parison with genomic data is well known to provide useful 
information about an organism [5-7]. This approach has 
led to studies on identifying new drug targets (e.g. [8-10]), 
understanding nematode biology [11], and detecting para- 
site protein-specific indels and evaluating their import- 
ance in parasitism and evolution [12], to name a few. 

The present study has generated extensive, stage-related 
information on the transcriptomes of C. oncophora and O. 
ostertagi. The comprehensive comparative transcrip- 
tomic analysis of stages representing the entire life 
cycles of these animals established gene expression 
patterns which characterize and delineate among each 
of the stages investigated. In addition, transcripts which 
are unique to free-living or parasitic stages have also been 
discovered. The resources and results in this study 
provided molecular insights that improve our under- 
standing of parasite biology and development, and identi- 
fied differential transcripts among stages/sexes. In broader 
terms, these analyses will be extremely useful for annotat- 
ing their upcoming genomes [13] and could enable the 
development of new methods to control infections by 
these parasites. 
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Results 

Transcript reconstruction and homologs in other 
nematodes 

Sequencing of the transcriptomes of C. oncophora and 
O. ostertagi resulted in 9,603,581 and 11,900,750 reads 
and 29,900 and 34,792 assembled transcripts and 
corresponding peptide translations, respectively (Table 1). 
These transcripts represent an estimated 81% and 74% 
of the complete transcriptomes (defined by detection of 
the conserved low copy eukaryotic genes [14]) wherein 
202 and 184 CEGs were detected in these two species, 
respectively. The transcript consensus sequences are 
available at http://nematode.net [15]. The number of 
transcripts likely over estimates gene discovery, as one gene 
could be represented by multiple non-overlapping tran- 
script fragments. Such 'fragmentation' [16], was estimated 
at 21% for C. oncophora and 22% for O. ostertagia. 

Sequence homologues for 68% of the predicted 
peptides of C. oncophora and 73% of those of O. 
ostertagi were found in at least one other nematode spe- 
cies (Figure 1). Approximately half of these homologues 
were common to sequences in all nematodes examined 
{see Materials and Methods). Strongylids had the largest 
subset of group specific homologues, while non-strongylid 
parasite species had the fewest (Additional file 1). Peptides 
predicted to be species-specific were significantly shorter 
in length, on average, than peptides with matches in other 
species (Additional file 2: Figure SI). This explains in part, 
the perceived sequence specificity in lieu of finding 
homologs as reported previously [17]. 

Transcript profiles throughout the C. oncophora and O. 
ostertagi life cycle stages 

On average, 35% of the transcripts of a given stage are 
constitutively expressed in that specie, and this was true 
for both species (Figure 2 A, 2B and Additional file 3). In 
C. oncophora, 21% are found in all stages, whereas 24% 



Table 1 Summary of generated reads, assembled 
trascripts and annotation information 





C. oncophora 


O. ostertagi 


Total number of 454 Reads 


9603581 


11900750 


Number of Reads Removed 


859727 


821428 


Number of Reads after Contamination 
Screening 


8743854 


11079322 


Number of Reads after Clustering 


3713617 


7079583 


Number of Mapped Reads 


6588676 


8102342 


CEG's 


202 


184 


Number of Predicted Peptides 


29900 


34792 


Number of peptides with InterPro match 


13812 


12274 


Number of peptides with Pfam match 


12311 


14317 


Number of peptides with GO match 


10511 


16130 




Figure 1 Distribution of protein homologues in free-living 

nematodes, Strongylida parasites, and Non-Strongylida 

parasites. The percent of homologues in each of the three 

databases as well as the overlap between the databases is shown. 

(A). C. oncophora; (B). O. ostertagi. For species included in each of 

the three databases please see the Materials and Methods. 
^ ) 

are found in all stages of O. ostertagi. The KEGG 
pathways analysis suggests that the majority of these 
transcripts are involved in genetic information 
processing and in particular, transcription and translation 
(Additional files 4 and 5); the InterPro (IPR) domains 
encoded by these transcripts confirm their associations 
with these functions (Additional file 6 and 7). One of the 
most prevalent domains in constitutively-expressed 
transcripts in both species is ubiquitin-associated/transla- 
tion elongation factor (IPR015940). 

While some of the peptides encoded by constitutively- 
expressed transcripts may not contain identifiable 
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Figure 2 Transcript expression in different developmental stages. This figure represents the number of transcripts expressed in each stage 
and the percent of those transcripts that are constitutively-expressed in C. oncophoro (A) and O. ostertogi (B). (C) C. oncophoro and (D) O. ostertogi 
depict the number of transcripts up-regulated, down-regulated and specific to a given stage. 



domains, most of them exhibit homology with other 
proteins. The majority of these peptides (88% and 90%, 
respectively) had homologs in at least one specie from 
the three phylogenetic databases to which they were 
compared, whereas 79% and 75% have homologs in all 
three databases suggesting that constitutively-expressed 
transcripts are involved in core cellular processes. As 
expected, peptides in C. oncophora and O. ostertagi had 
higher numbers of homologs among the Strongylida 
parasites (-88% in both species) than any other group; 
the fewest number were shared with the non-Strongylida 
nematodes. 

The number of transcripts expressed in only one stage 
was small (<1%; Figure 2C, 2D and Additional file 8). In 
general, transcripts expressed in the later stages i.e. adult, 
had a high number of homologs (-64%) in other species, 
whereas those expressed in the earlier stages i.e. egg, had 
fewer (25% and 17%, respectively). The parasitic stages in- 
cluding the L3sh exhibited a higher number of homologs 
in the strongylid parasites than in the other two groups of 
species, whereas more of the transcripts expressed in the 
free-living stages showed similarity with organisms in the 
two non-Strongylida groups than with those in the 
Strongylida group with the exception of the L3sh. 

Comparing stage-specific transcript expression within 
species revealed that the majority of transcripts expressed 



in each stage are not differentially-expressed (Additional 
file 8); -20% of transcripts in both species are up- 
regulated in any given stage whereas -26% are down- 
regulated. Comparative values for up- and down-regulated 
transcripts are shown in Figure 2C and 2D. On average 
74% and 68% of up-regulated transcripts have homologs 
in at least one nematode group to which they were 
compared; up-regulated transcripts had a higher number 
of homologs to Strongylida parasites only (70% for C. 
oncophora and 73% for O. ostertagi). As with the 
constitutively-expressed transcripts, translation is the 
most prevalent KEGG category in both C. oncophora 
and O. ostertagi. Most transcripts (-89% in C. 
oncophora and -93% in O. ostertagi) are up-regulated in 
more than one stage likely resulting from carryover 
between consecutive stages. 

There was a total of 1393 transcripts identified as en- 
coding putatively-secreted peptides of which 538 were 
enriched in at least one stage. It was determined that 
free-living stages tended to have more of these 
transcripts in common with each other than with the 
parasitic stages. Parasitic stages tended to have a com- 
mon pool of secreted peptides as well. The exception to 
this was C. oncophora L4 which shared more secreted 
peptides with the free-living stages than with the other 
parasitic stages. The 5% of domains most prevalent in 
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the secreted peptides were very similar between the two 
species. Transthyretin-like, metridin-like ShK toxin, 
saposin B, and CAP domains were among the most 
prevalent for secreted proteins in both species. Two in- 
sulin domains were among the most prevalent in 
secreted peptides of C. oncophora but were absent from 
O. ostertagi. Ves allergen was found in 16 secreted 
peptides of O. ostertagi but was found in only one 
secreted peptide of C. oncophora. 

Differences in gene expression and associated functions 
between free-living and parasitic stages 

Pfam domains were identified in 41% of the peptides in 
both C. oncophora and O. ostertagi matching 2507 and 
2658 different domains, respectively. In both organisms 
the most prevalent domain was RNA recognition motif 
(PF00076) (Table 2). 

An examination of transcripts expressed in the free- 
living (egg, LI, L2, L3sh) and parasitic (L3ex, L4, adult) 
stages of development revealed that some Pfam domains 
are abundant in both phases of development while 
others are unique to a single stage or phase. The most 

Table 2 The three most abundant Pfam, InterPro, and GO 
terms associated with the peptides 

Number of peptides 

Description G O. 

oncophora ostertagi 



Pfam Code 
PF00076 
PF00069 
PF01060 

IPR code 
I PR0 16040 
IPR01 1 009 
IPR012677 



RNA recognition motif 
Protein kinase domain 
Transthyretin-like family 

NAD(P) - binding domain 

Protein kinase-like domain 

Nucleotide-binding alpha-beta 
plait 



GO code 

Biological process 

GO:0006508 oxidation-reduction process 

GO:0008152 metabolic process 

GO:0006457 proteolysis 
Cellular component 

GO:0005622 intracellular 

GO:0016020 membrane 

GO:0016021 integral to membrane 
Molecular Function 

GO:0005515 protein binding 

GO:0005524 ATP binding 

GO:0003824 Catalytic activity 



208 
170 
164 

269 
252 
244 



759 
713 
492 

745 
575 
544 

1537 
877 



226 
192 
169 

336 
271 
284 



842 
579 

769 
729 
626 

1735 
1050 



abundant Pfam domain (chromo domain) in the free- 
living stages of C. oncophora was expressed solely in this 
phase of development while two of the top three 
domains (Lectin C-type domain and trypsin) in the para- 
sitic stages were not expressed in any of the free-living 
stages (Additional files 9 and 10). Domains like the RNA 
recognition motif were found equally in both phases. 

A total of 35% of C. oncophora peptides and O. 
ostertagi peptides could be associated with GO terms 
categorized as 'biological process', cellular component', 
and/or molecular function' (Table 1 and Additional files 
11, 12, 13, 14, 15 and 16). Examination of GO terms 
associated with the peptides reveals significant differences 
between parasitic and free-living stages (Figure 3). 
Significantly-enriched molecular functions in the para- 
sitic stages of O. ostertagi and C. oncophora included 
binding (GO:0005488), protein binding (GO:0005515), 
and catalytic activity (GO:0003824). In the free-living 
stages, sodium:potassium-exchanging ATPase activity 
(GO:0005391) and aspartic-type endopeptidase activ- 
ity (GO:0004190) were enriched in C. oncophora 
while oxygen binding (GO:0019825) and sequence 
specific DNA binding (GO:0043565) were enriched 
in O. ostertagi, 

A total of 4,160 and 4,135 unique InterPro domains 
were detected in 46% of C. oncophora and 35.3% of O. 
ostertagi peptides with the most prevalent domain being 
'NAD (P)-binding' domain (Table 2). In the free-living 
stages, globin, zinc finger domains, and chromo domains 
were among the most prevalent (Figure 4 and Additional 
files 17 and 18). In the parasitic stages, metridin-like 
ShK toxin, CAP domain, and C-type lectins were among 
the most prevalent motifs (Figure 4). Clustering based 
on the number of IPR domains found in up-regulated 
peptides revealed that consecutive stages tend mainly to 
cluster together with the exception of peptides from the 
egg (Figure 5). In both species, the domains found in 
these peptides tend to be linked to the adult stage, which 
is likely due to the presence of fertilized eggs in the 
adults. 

C. elegans had 8,896 proteins with RNAi phenotypes in 
the stages analogous to free-living C. oncophora and O. 
ostertagi, and 8,205 proteins in the parasitic stages (i.e. 
post dauer). C. oncophora had 29 polypeptides from the 
free-living stages and 68 from the parasitic stages with 
homologs to the C. elegans genes with available RNAi 
phenotypes, whereas O. ostertagi shared 53 homologous 
polypeptides from free-living stages and 120 polypeptides 
from the parasitic stages, with C. elegans genes of known 
RNAi phenotype. For most RNAi phenotypes inferred, 
there were no significant differences between the numbers 
of polypeptides in the two species and the numbers of 
proteins in C. elegans that exhibited those phenotypes. C. 
oncophora had significantly more peptides with predicted 
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Figure 3 GO term associations with transcripts expressed in each stage. For each phase of the life cycle (free-living or parasitic) several 
prevalent GO terms are listed. * indicates a given term is significantly-enriched in that life cycle stage (p < 0.05). (A) C. oncophora; (B) 0. ostertagi. 



RNAi growth phenotypes (p = .007) in the parasitic stages 
when compared to C. elegans. In contrast, O. ostertagi 
exhibited a significantly greater number of peptides with 
larval lethal phenotypes (p = 2.8E-05) in the parasitic 
stages (Figure 6) relative to C. elegans. 

Comparison of the up-regulated transcripts to the 
KEGG pathways revealed an increase in the number of 
transcripts involved in metabolism of cofactors and 
vitamins in the parasitic stages of C. oncophora (p = 0.04). 
In the free-living stages of O. ostertagi, there were signifi- 
cantly (p = 0.01) more transcripts involved in energy me- 
tabolism when compared to the parasitic stages (Figure 7). 

Discussion 

The gastrointestinal parasites studied here exhibit nu- 
merous biological similarities. They begin their lives as 
eggs that are passed in the feces from the host. They re- 
main as free-living organisms up to and including the 
L3sh at which time they are ingested by the host, ex- 
sheath and then continue their development as parasitic 
organisms within the host. Examination of transcripts in 
both species revealed that 68.8% in C. oncophora and 
73.0% in O. ostertagi have sequence homologues in the 
other species examined in this study (Figure 1) and that 
-60% of strongylid genes have homologs in C. elegans 
[7,18]. While we have identified few peptides (0.2% and 
0.3%, respectively) that share homology only to non- 
Strongylida species (Additional file 1), mainly Ascaris. 



suum and Brugia malayU these are likely homologous 
peptides not yet identified in other Stongylida species 
because of the incomplete nature of their genome 
sequences. Our study showed similar results in that 
BLAST searches identified homologous sequences in 
55.1% of C. oncophora and 57.9% of O. ostertagi 
polypeptides when compared with free-living nematodes. 
The slightly higher percentages observed in this study 
can be attributed to the better coverage of the Cooperia 
and Ostertagia transcriptomes using pyrosequencing 
relative to the coverage obtained from conventional EST 
libraries in previous investigations (e.g. [5]). Because of 
differences in the environments and living requirements 
between the free-living and parasitic stages, it is 
expected that some pathways and enzymes will be 
unique to these two phases of development and coincide 
with the requirements and challenges imposed by the 
different environments. Comparisons of domains and 
pathways present in the free-living stages to those in the 
parasitic stages revealed many of these differences. 

Given the similarities between C. oncophora and O. 
ostertagi, it was not unexpected that there would be sig- 
nificant overlap in the domains found in up-regulated 
peptides in the various stages. For example, among the 
20 most abundant domains in all stages, ten were identi- 
cal in both organisms. The domains that were prevalent 
in the free-living vs. parasitic stages may provide clues 
to the lifestyles and environments in which these 
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Figure 4 Significantly-enriched (p < 0.05) InterPro domains in the free-living (egg, LI, 12, L3sh) or parasitic (L3ex, L4 and adult) stages 
of (A) C. oncophora and (B) O. ostertagi. 



organisms live. In the free-living stages, domains previ- 
ously implicated in growth and development tended to 
dominate. In C. oncophora three different chromo 
domains (IPR000953, IPR016197, and IPR023780) and 
the MADF domain (IPR006578) were enriched (Figure 4, 
Additional file 17). Chromo domains are often found in 
association with heterochromatin protein 1 (HP1) which 
functions in germline and vulval development in C. 
elegans [19]. The MADF domain is a transcription factor 
in Drosophila that activates genes necessary for develop- 
ment [20]. Chromo domains and MADF domains were 
found in proteins that predominate in the egg as would 
be expected (Additional file 18). Interestingly, the 
chromo domain (IPR000953) and MADF domain 



(IPR006578) were also found elevated in adult O. 
ostertagi. Two domains identified as basic leucine 
zippers [21] (IPR004827 and IPR011700) were up- 
regulated in the free-living stages of O. ostertagi. 

As the organisms transition to LI, the domain preva- 
lence shifts as well. In C. oncophora, the most prevalent 
domain was EF-hand-like domain (IPR011992). This 
domain tends to be found in calcium binding proteins 
[22], In contrast, the most prevalent domain in O. 
ostertagi was globin (IPR0 12292). Globin and saposin 
domains were prevalent in the L2 of both species. Both 
of these domains were found in secreted peptides of 
both species. Saposin domains are expressed in all stages 
of Ancylostoma caninum [23]. While they were not 
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Figure 5 Clustering of stages based upon the number of transcripts in a stage containing a specific InterPro domain. (A) C. oncophora; 
(B) O.ostertogi. A lower-range scale (0 to 3+) was used to better illustrate the similarities and differences between the stages. 



found in enriched peptides in every stage of C. 
oncophora or O. ostertagi, these domain containing 
peptides were expressed in all stages. 

During the L3sh, the worms both protect themselves 
from environmental stress as well as prepare for uptake 
by and development within the host. Among the most 
prevalent domains in the L3sh were protease inhibitor 18 
(IPR002919) and late embryogenesis abundant (LEA) 
protein (IPR004238) in C. oncophora and O. ostertagi, 
respectively. Among the multitude of roles played by 
protease inhibitors, it has been suggested that they are 
also involved in protecting invading organisms from host 
molecules, in particular, those derived from the gastro- 
intestinal tract, such as pepsin. In this way gastrointes- 
tinal nematodes can safely navigate and survive within 
the host digestive tract [6]. Late embryogenesis abundant 
proteins have been shown to play a role in protection 
from the environment. In Aphelenchus avenae (plant 
pathogenic roundworm), LEA proteins help protect other 
proteins from aggregating during times of low water and 
possibly play a role in preventing desiccation [24] . 

During the parasitic stages beginning with the L3ex, it 
is expected that transcriptional profiles will shift towards 
host interaction while maintaining those profiles 
associated with worm development. Zinc finger domains 
(IPR007087 and IPR015880) which are important in cell 
differentiation and development [25] were indeed among 
the most prevalent domains in the L3ex of C. oncophora 
and in O. ostertagi adults possibly resulting from add- 
itional rapid growth as the worms emerge from the gas- 
tric glands. In O. ostertagi L3ex, the most prevalent 
domains found in the greatest number of peptides, were 
DUF148 and metridin-like ShK toxin. The metridin-like 



ShK toxin domain (IPR003582) was up-regulated in O. 
ostertagi parasitic stages and was the most prevalent 
domain in the L4 stage. Noteworthy is that the 
metridin-like ShK toxin domain is often found near the 
C-terminus of C. elegans metallopeptidases. It is sug- 
gested that these domains are important in parasitic 
interactions [26]. CAP domains (IPR014044) were also 
among the most prevalent domains in C. oncophora 
L4 and O. ostertagi adults (Figure 4 and Additional 
files 17 and 18); however, among putatively-secreted 
peptides, CAP domains were observed in C. oncophora 
L3sh, L4, and adults, and in O. ostertagi L4. In mammalian 
species, proteins harboring CAP domains are divided into 
nine subfamilies which encompass cysteine-rich secretory 
proteins (CRISPs). Similar CRISP domains (PF00188) were 
up-regulated in Ostertagia (Additional file 10) and have 
recently been identified in the Lethenteron japonicum 
(parasitic lamprey) [27] which secretes a CRISP containing 
protein from its buccal glands once it has attached to the 
host. It is believed that this CRISP protein enhances vaso- 
dilation and feeding [28] . It should be noted that the con- 
cept of "secretory proteins" is defined as a cellular event 
and not necessarily a function related to parasites 
secretions. As such, there need not be a direct relationship 
between CRISP proteins and "extraorganismal" function- 
ality i.e., parasite secretory products. Case in point, in 
mammals, CRISP proteins are well known to be associated 
with cell signaling, reproduction, fertilization and the 
maturation of spermatozoa. As such, it may not be coinci- 
dental that in parasites, an abundance of CRISP proteins 
is associated with the later larval and adult stages of worm 
development. CRISP domains have been found associated 
with proteins with immunomodulatory activity [29] and 



Heizer et al. BMC Genomics 2013, 14:1 18 
http://www.biomedcentral.eom/1 471 -21 64/1 4/1 1 8 



Page 9 of 14 




RNAi phenotype 



(B) 




RNAi phenotype 



Figure 6 Comparison of phenotype distribution between RNAi- 
surveyed C. elegans genes, and C. onchophora and 0. ostertagi 
homologues to C. elegans genes with similar phenotypes. (A). 

The percent of C. oncophoro peptides encoded by transcripts 
expressed in free-living stages (egg, L1, L2 and L3sh) with homologs 
to C. elegans proteins with various RNAi phenotypes. (B). The 
percent of 0. ostertagi peptides encoded by transcripts expressed in 
parasitic stages (L3ex, L4 and adult) with homologs to C. elegans 
proteins with various RNAi phenotypes. "*" indicates that for a 
specific RNAi phenotype, significantly more (p < 0.05) peptides in 
either C. oncophora or 0. ostertagi exhibited that phenotype than 
did C. elegans. 



have been studied in few parasitic nematode species in- 
cluding the hookworm A caninum [30] and the murine 
strongylid nematode, Heligmosomoides polygyrus [31]. 

It is well known that proteins such as chymotrypsin, 
trypsin, and peptidase [32] are involved in the 
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Figure 7 Distribution of KEGG categories associated with up- 
regulated transcripts. The number of up-regulated transcripts in 
free-living and parasitic life stages associated with KEGG categories is 
compared in (A) C. oncophora and (B) O. ostertagi. * indicates 

significance (p <0.05). 
v J 

breakdown of proteins into constituent parts. Chymo- 
trypsin (IPR001254) domains were up-regulated in the 
parasitic stages of C. oncophora and found only in the 
parasitic stages of O. ostertagi (Figure 4 and Additional 
file 17 and 18); trypsin-like domains (IPR009003) were 
up-regulated in C. oncophora, and; peptidase S1/S6 
(IPR001254) was one of the most prevalent domains in 
female C. oncophora (Figure 4 and Additional file 17). 
Given their abundance in the later stages of develop- 
ment (e.g. L4 and adult), it is possible that proteins 
associated with these domains collectively play a role in 
the feeding process [24] . This is supported in part by the 
observation that these domains are present in nine 
secreted peptides in C. oncophora and 75 in O. ostertagi. 
It is possible that a subset of these is not only secreted 
from the cell but also from the parasite. Given that the 
adult diets of these parasites vary based upon either 
abomasal (Ostertagia) or intestinal (Cooper ia) contents, 
these secreted proteases may also participate either in 
countering the host immune responses (L4) by hydrolyzing 
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antibodies, or in establishment in the host (L3 ex) particu- 
larly as it relates to Ostertagia and its need to enter the 
gastric glands and keep inflammation at bay. 

The three C-type lectin domains (IPR001304, IPR 
016186, and IPR016187) were the most prevalent 
domains in male C oncophora (Additional file 17) and 
were up-regulated as well in O. ostertagi (Additional file 
18). As expected, all three of these domains are found in 
putatively secreted peptides in both species predomin- 
antly because evolutionarily, the superfamily of proteins 
containing C-type lectin domains is comprised of 
extracellular metazoan proteins with diverse functions. 
In general, these domains are involved in calcium 
dependent carbohydrate binding. However, it should also 
be noted that not all proteins containing C-type lectin 
domains can actually bind carbohydrates or even Ca 2+ . 
Indeed, most of the proteins containing this domain and 
referred to as C-type lectins are not lectins. Nonetheless, 
those with functionality have been implicated in innate 
immune responses in invertebrates, and have been 
linked to proteins involved at the host-parasite interface 
which may assist in evading the host immune response 
[33]. As such, differences in the levels of these domains 
between C. oncophora and O. ostertagi may in part be 
associated with the observed variation in host immunity 
as well as distinction in the predilection sites of the re- 
spective L4s and adult worms. A closer investigation of 
sequence similarity to C-type lectins from free-living 
and parasitic nematodes and an analysis of the locus to 
which these proteins are eventually translocated might 
shed light on physiological functionalities as they relate 
either to sustaining life within the organism or control- 
ling the host-pathogen interface. Some nematode C-type 
lectins have been linked to the parasite surface i.e., the 
epicuticle. Among other things, the nematode cuticle is 
comprised of collagen proteins and these proteins ex- 
hibit stage specific expression [16,34]. 

Examination of KEGG categories demonstrated signifi- 
cant associations between life cycle stages and peptides 
involved in energy metabolism in O. ostertagi where 24 
peptides were found in the free-living stages and only 
four in the parasitic stages (Figure 7). Further analysis of 
these 24 polypeptides provided clues about environmen- 
tal adaptation. From the egg stage through L2, the 
worms are present in the fecal pat. Upon developing to 
L3sh they become more motile and migrate from the 
pat to better position themselves for ingestion by the 
host. Of the 24 peptides involved in energy metabolism 
in the free-living stages of development, 17 are as- 
sociated with methane metabolism. As the free-living 
stages of both species are found in the fecal pat and the 
fecal pat is a methane rich environment, this is not sur- 
prising. Only one of the 24 peptides is up-regulated in 
the L3sh and classified as an enzyme involved in 



oxidative phosphorylation rather than methane metabol- 
ism. It is possible that this becomes more functional as 
the worm distances itself from the fecal pat and readies 
itself for ingestion by the host. It is also interesting to 
speculate that environmental queues i.e. host GI tract, 
may down-regulate transcriptional activity of the 
proteins involved in methane metabolism and in turn in- 
duce exsheathment and worm development. 

In C. oncophora, the KEGG category "metabolism of 
cofactors and vitamins" was significantly more abundant 
in the parasitic stages than in the free-living stages 
(Figure 7). The specific enzymes involved are associated 
with pantothenate and Co A biosynthesis [35], and 
thiamine metabolism. All three peptides were up- 
regulated only in adult females. Inasmuch as these 
enzymes were not observed in abundance in fecal eggs, 
their functions are likely related specifically to females 
or to egg development in utero. 

While many of the transcripts were stage specific, 
others were expressed in all stages. These constitutively- 
expressed transcripts are likely involved in core molecu- 
lar processes used to sustain life, as shown by the 
domains found within them. This conclusion is also 
bolstered by the embryonic lethal phenotypes predicted 
for the majority of the constitutively-expressed trans- 
cripts that link to an RNAi phenotype in C. elegans. 
These transcripts and their encoded proteins should 
make attractive drug targets provided sufficient variation 
can be found between parasite and host proteins. 

Conclusions 

Control of parasitic nematodes is routinely accomplished 
through anthelminthic drugs. Resistance to these drugs 
is increasingly becoming a problem especially in live- 
stock hosts. To date, resistance has surfaced to nearly all 
commercially available drugs [36]. In an effort to better 
understand this resistance and help combat the higher 
production costs associated with the lack of efficacy, a 
detailed study of these parasites at a molecular level was 
conducted. To this end, we have generated comprehen- 
sive data on the transcriptomes of all discernible life 
cycle stages of these two organisms. The genome 
sequences for C. oncophora and O. ostertagi have been 
initiated in an effort to complement and complete this 
work (http://www.genome.gov/10002154) [13]. The 
cDNA sequences generated in this study will enable bet- 
ter annotation of these genomes upon completion. In 
the current study, many differences were revealed be- 
tween the free-living and parasitic stages of these 
nematodes when examined at the domain, process and 
pathway levels. During the free-living stages of develop- 
ment, peptides and pathways involved in growth and de- 
velopment were more prominent. In contrast, peptides, 
domains and pathways that traditionally function in the 
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degradation of proteins were more prevalent during the 
parasitic stages. These differences are likely associated 
with host adaptation and therefore parasitism. Further 
in-depth examination of the differences in domain 
prevalence and expression between the free-living and 
parasitic stages may reveal conservation in genes linked 
to infection, host recognition, immune response and dis- 
ease. Equally important is understanding the similarities 
between evolutionarily related organisms in the hope of 
detecting biological and molecular threads that link the 
parasitic stages. In this way, we may better identify targets 
for the development of new classes of nematocides. 
Holistic approaches such as this could extend new 
treatments to human pathogens as well. 

Methods 

Sample preparation, library construction, and sequencing 

Ostertagia ostertagi eggs were purified from the feces of 
calves infected with O. ostertagi by sequentially sieving 
diluted fecal material over 400, 150 and 64 \im sieves, 
and finally collecting the eggs on a 37 \im sieve. To col- 
lect LI, the eggs were incubated for 24 h at 23°C in tap 
water after which the larvae were purified by 
baermannization. The L2 were collected by culturing the 
feces for 5 days at 23°C followed by baermannization. 
The larvae were confirmed to be L2 by measuring them 
under the microscope. The L3 sheathed (L3sh), L3 
exsheathed (L3ex) and L4 were prepared as previously 
described [37]. 

Adult parasites of O. ostertagi were microscopically- 
selected from abomasal contents from animals killed 28 
days post infection. Cooperia oncophora eggs, LI, L2, 
L3sh and L3ex were also collected as described above. 
The L4 were obtained by baermannization of intestinal 
contents and washings from animals euthanized 10 days 
post infection; adult worms were microscopically collected 
from animals euthanized 21 days post infection and fur- 
ther partitioned into male [M] and female [F] worms. 

Total RNA was prepared by homogenizing all parasite 
samples in Trizol (Invitrogen). All RNA samples were 
DNAse treated prior to mRNA isolation and sequencing. 
The integrity and yield of the RNA was verified by the 
Bioanalyzer 2100 (Agilent Technologies, Cedar Creek, 
Texas). Total RNA (3 \xg) was treated with Ambion 
Turbo DNase (Ambion/ Applied Biosystems, Austin, 
TX). Approximately 1.4ug male and 2.7 \ig female total 
RNA were used as the templates for cDNA library con- 
struction using the Accuscript HF Reverse Transcriptase 
Kit (Agilent Technologies, Cedar Creek, Texas) and 
SMART primers (Invitrogen, Carlsbad, CA). PCR cycle 
optimization was performed to determine the minimum 
cycle number to amplify full-length cDNA products 
using the SMART primers and Clontech Advantage-HF 
2 polymerase Mix (Clontech/Takara Bio, Mountain 



View, CA). Amplification was carried out for 30 cycles 
for the male sample and 27 cycles for the female sample. 
PCR cycle optimization was performed with normalized 
cDNA to determine the threshold cycle number using 
the SMART primers and Clontech Advantage-HF 2 
polymerase mix previously mentioned. The determined 
number of cycles was 14 for both the male and female 
samples. Finally, 5' and 3' adaptor excision was per- 
formed by digestion with Mmel. The excised adaptors 
were removed utilizing AMPure paramagnetic beads 
(Agencourt, Beckman Coulter Genomics, Beverly, MA). 
Five micrograms of the cDNA was run on a 0.8% GTG 
Seakem agarose gel for size selection (Lonza, Basel, 
Switzerland). Fragments in the 300-800 bp size range 
where end polished and ligated to 454 Titanium library 
adaptors utilizing reagents from the Titanium General 
Library Kit (Roche 454). An AMPure (Agencourt, 
Beckman Coulter Genomics, Beverly, MA) bead cleanup 
was performed to remove library adaptor dimers and 
cDNA fragments less than 300bp in length. The library 
was immobilized with Strepavidin beads (Library 
Immobilization Bead and Buffer Kits from 454 Roche) 
and single stranded with 0.1 25N Sodium Hydroxide. 
The single-stranded library was quantitated by a Quant- 
it single stranded DNA assay using the Qubit 
(Invitrogen, Carlsbad, CA) and the integrity validated 
using the Bianalyzer 2100 (Agilent Technologies, Cedar 
Creek, Texas). The library fragments were immobilized 
onto DNA capture beads supplied in the 454 Titanium 
Clonal Amplification kits. (Roche 454, Branford, Con- 
necticut) The captured DNA library was emulsified and 
subjected to PCR in order to amplify the DNA template. 
The emulsion was chemically broken and the beads 
containing the DNA were recovered and up-regulated 
utilizing bead recovery reagents (Roche 454, Branford, 
Connecticut). The DNA library beads were loaded onto a 
PicoTiterPlate device and sequenced on the Genome Se- 
quencer 454 Titanium instrument using the GS FLX titan- 
ium Sequencing Kit (Roche 454, Branford, Connecticut). 

Analytical processing of the reads, assembly and 
comparative analysis 

cDNA sequence data for C. oncophora (egg, LI, L2, 
L3sh, L3ex, L4, adult males [M], and adult females [F]) 
and O. ostertagi (egg, LI, L2, L3sh, L3ex, L4, and adult) 
were screened for adaptor sequences using Seqclean 
http://compbio.dfci.harvard.edu/tgi/. The reads were 
then analyzed using the Newbler assembler v2.5 run- 
Mapping and those representing host contamination were 
removed from further consideration. The remaining reads 
were clustered using cd-hit-est [38] at 99% identity. The 
resulting representative reads were assembled into contigs 
using the Newbler assembler v2.5. Each stage was 
assembled individually and then the contigs were 



Heizer et al. BMC Genomics 2013, 14:1 18 
http://www.biomedcentral.com/1471-2164/14/118 



Page 12 of 14 



assembled by PHRAP (http://www.phrap.org/), using de- 
fault settings, resulting in assembled transcripts. BLAT 
[39] was utilized (75% identity over 90% of length) to map 
the 8.7 million and the 11 million C. oncophora and O. 
ostertagi reads to the corresponding PHRAP assembly for 
expression profiling. The degree of fragmentation was 
determined as previously described [16]. 

Assembled transcripts were translated utilizing pro- 
t4est [40] and are available for acquisition and searching 
at http://nematode.net [15]. Predicted peptides were 
compared to the core eukaryotic genes (CEGs) [41] 
using HMMER [42] to estimate the completeness of 
each transcriptome. Hits to the CEG s were determined 
using the suggested cutoffs [41]. Predicted peptides were 
further analyzed using InterProScan [43] using tags to 
search for InterPro domains, GO terms, and Pfam 
domains. Putative secreted peptides were determined 
utilizing Phobius [44]. Peptides containing a signal pep- 
tide for secretion and no transmembrane sequences 
were designated as putatively secreted. Analyses of puta- 
tively secreted peptides were only performed on those 
shown to be up-regulated in at least one stage. 

BLAST searches were used to compare the trans- 
criptomes of C. oncophora and O. ostertagi to either gen- 
omic or transcriptomic data from thirteen other species 
subdivided into free-living nematodes {Caenorhabditis 
elegans, C. japonica, C. remanei, C. brenneri, C. briggsae 
and Pristionchus pacificus), Strongyloid parasites 
(Dictyocaulus viviparus, Teladorsagia circumcincta, 
Trichostrongylus colubriformis and Oesophagostomum 
dentatum) and non-Strongyloid nematode parasites (A 
suuni, Trichinella spiralis, and B. malayi) (le-05, bit score 
of 50, and only the best hit). The BLAST output files are 
available at nematode.net [15]. Additional searches and 
comparisons were performed against the KEGG database 
(le-05), and against each other (le-05 and bit score of 50). 

After reads were re-aligned to the transcripts using 
BLAT, the depth of coverage of each contig was 
calculated by dividing the lengths of all reads contribut- 
ing to a contig by the length of the contig (Additional 
files 19 and 20). The coverage of a specific contig was 
then compared between the various stages using a bino- 
mial distribution and a p- value of 0.01 to determine the 
enrichment or depletion of reads. The hypergeometric 
function identifies nearly identical contig lists as EdgeR 
[45], but is much more lenient in significance cutoffs, 
resulting in more transcripts being identified as 
differentially-expressed (summary of the comparison is 
provided in Additional file 21 and the list of all 
differentially-expressed transcripts using each of the 2 
methods is provided as Additional file 22). The up- 
regulated reads were grouped depending on whether 
they came from a free-living stage (egg-L3sh) or a para- 
sitic stage (L3ex-adult). Prevalence of InterPro domains 



[46], GO categories [47], Pfam domains [48], KEGG [49] 
categories, and RNAi phenotypes was compared 
between the free-living and parasitic stages utilizing a G- 
test (p > 0.05). 

Putative RNAi phenotypes were determined by com- 
paring sequences derived herein to known C. elegans 
RNAi phenotypes as listed on WormMart [50]. In order 
to compare the C. elegans RNAi phenotypes to the free- 
living and parasitic stages of the nematodes in this study, 
the proteins in C. elegans were subdivided into two 
groups; all stages from the egg to the L3 dauer were 
considered akin to the "free-living" stages while dauer 
exit to adult worms were equated to the "parasitic" 
stages. If a polypeptide had multiple phenotypes, only 
the most severe was utilized in order of decreasing le- 
thality i.e., embryonic lethal > larval lethal > sterile, 
growth > embryonic non-lethal > other. Identification of 
significant differences in categorical RNAi phenotype 
numbers between C. elegans and either C. oncophora or 
O. ostertagi was performed using a G-test (p > .05). 



Additional files 



Additional file 1: C. oncophora and O. ostertagi transcripts 
homologous to non-Strongylida only. Description: The BLAST hits of 
the C. oncophora and 0. ostertagi transcripts that share homology only to 
the non-Strongylida database. 

Additional file 2: Figure SI. Length distribution of peptides with and 
without homologues in other species. Description: Histogram of the 
length of peptides that have homologues in other species and those that 
do not have homologues i.e. are unique to either C. oncophora or 0. 
ostertagi. 

Additional file 3: Constitutively-expressed transcripts in C. 
oncophora and O. ostertagi. Description: List of all transcripts 
constitutively-expressed in C. oncophora and 0. ostertagi. 

Additional file 4: KEGG categorization of constitutively-expressed 
transcripts in C. oncophora. Description: Details of the KEGG matches 
for all constitutively-expressed transcripts in C. oncophora. 

Additional file 5: KEGG categorization of constitutively-expressed 
transcripts in O. ostertagi. Description: Details of the KEGG matches for 
all constitutively-expressed transcripts in 0. ostertagi. 

Additional file 6: InterPro annotation of C. oncophora peptides. 

Description: Annotation based upon InterPro of all C. oncophora peptides. 

Additional file 7: InterPro annotation of O. ostertagi peptides. 

Description: Annotation based upon InterPro of all 0. ostertagi peptides. 

Additional file 8: Number of transcripts unique to a stage as well 
as the number up-regulated or down-regulated in a stage. 

Description: Number of transcripts uniquely-expressed in a stage or 
differentially-expressed among stages. 

Additional file 9: Prevalence of Pfam domains in C. oncophora 
(up-regulated transcripts only). Description: The number of transcripts 
in each stage identified to hit a specific Pfam domain. 

Additional file 10: Prevalence of Pfam domains in O. ostertagi 
(up-regulated transcripts only). Description: The number of transcripts 
in each stage identified to hit a specific Pfam domain. 

Additional file 11: Prevalence of GO biological processes in C. 
oncophora (up-regulated transcripts only). Description: The number of 
transcripts in each stage associated with a specific GO biological process. 
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Additional file 12: Prevalence of GO cellular components in C. 
oncophora (up-regulated transcripts only). Description: The number of 
transcripts in each stage associated with a specific GO Cellular 
component. 

Additional file 13: Prevalence of GO molecular function in C. 
oncophora (up-regulated transcripts only). Description: The number of 
transcripts in each stage associated with a specific GO Molecular 
function. 

Additional file 14: Prevalence of GO biological processes in O. 
ostertagi (up-regulated transcripts only). Description: The number of 
transcripts in each stage associated with a specific GO biological process. 

Additional file 15: Prevalence of GO cellular components in O. 
ostertagi (up-regulated transcripts only). Description: The number of 
transcripts in each stage associated with a specific GO Cellular 
component. 

Additional file 16: Prevalence of GO molecular function in O. 
ostertagi (up-regulated transcripts only). Description: The number of 
transcripts in each stage associated with a specific GO Molecular 
function. 

Additional file 17: Prevalence of Intepro domains in C. oncophora 
(up-regulated transcripts only). Description: The number of transcripts 
in each stage identified as a specific InterPro domain. 

Additional file 18: Prevalence of Intepro domains in O. ostertagi 
(up-regulated transcripts only). Description: The number of transcripts 
in each stage associated with a specific InterPro domain. 

Additional file 19: Depth of coverage of C. oncophora transcripts. 

Description: The calculated depth of coverage for all transcripts in C. 
oncophora. 

Additional file 20: Depth of coverage of O. ostertagi transcripts. 

Description: The calculated depth of coverage for all transcripts in 0. 
ostertagi. 

Additional file 21: Summary of comparison of differentially- 
expressed transcripts based on hypergeometric binomial 
distribution and EdgeR. Description: The number of transcripts 
identified as being differentially-expressed using the two different 
methods. 

Additional file 22: Differentially-expressed transcripts in C. 
oncophora and O. ostertagi using hypergenometric and EdgeR 
analysis. Description: List of all identified differentially-expressed 
transcripts in C. oncophora and 0. ostertagi using hypergenometric and 
EdgeR analysis. 
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